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Abstract 

The RNA exosome is the major 3'-5' RNA degradation machine of eul<aryotic cells and participates in processing, surveillance 
and turnover of both nuclear and cytoplasmic RNA. In both yeast and human, all nuclear functions of the exosome require the 
RNA helicase MTR4. We show that the Arabidopsis core exosome can associate with two related RNA helicases, AtMTR4 and 
HEN2. Reciprocal co-immunoprecipitation shows that each of the RNA helicases co-purifies with the exosome core complex 
and with distinct sets of specific proteins. While Atl\/1TR4 is a predominantly nucleolar protein, HEN2 is located in the 
nucleoplasm and appears to be excluded from nucleoli. We have previously shown that the major role of AtMTR4 is the 
degradation of rRNA precursors and rRNA maturation by-products. Here, we demonstrate that HEN2 is involved in the 
degradation of a large number of polyadenylated nuclear exosome substrates such as snoRNA and miRNA precursors, 
incompletely spliced mRNAs, and spurious transcripts produced from pseudogenes and intergenic regions. Only a weak 
accumulation of these exosome substrate targets is observed in mtr4 mutants, suggesting that I\/1TR4 can contribute, but plays 
rather a minor role for the degradation of non-ribosomal RNAs and cryptic transcripts in Arabidopsis. Consistently, transgene 
post-transcriptional gene silencing (PTGS) is marginally affected in mtr4 mutants, but increased in heni mutants, suggesting 
that it is mostly the nucleoplasmic exosome that degrades aberrant transgene RNAs to limit their entry in the PTGS pathway. 
Interestingly, HEN2 is conserved throughout green algae, mosses and land plants but absent from metazoans and other 
eukaryotic lineages. Our data indicate that, in contrast to human and yeast, plants have two functionally specialized RNA 
helicases that assist the exosome in the degradation of specific nucleolar and nucleoplasmic RNA populations, respectively. 
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Introduction 

Efficient processing and degradation of RNA is a key process for 
tlie post-transcriptional control of gene expression. The main 3 '-5' 
RNA degradation machine of eukaryotic cells is the exosome, a 
multi-subunit complex found in both cytoplasm and nuclear 
compartments [1,2]. The exosome participates in a plethora of 
processing and degradation reactions, including the processing of 
ribosomal RNAs, snoRNAs and snRNAs, the turnover and quality 
control of mRNAs and the efficient elimination of RNA maturation 
by-products and diverse RNA species generated from non-genic 
regions [3-5] . In vivo, exosome activity requires the interaction of 
the exosome complex with associated RNA helicases. In yeast, 
cytoplasmic and nuclear exosomes are activated by the RNA 
helicases SKI2 and MTR4, respectively [6-8]. In both yeast and 
human MTR4 is an essential protein required for all functions of the 



nuclear exosome [9,10]. Interestingly, y4ra6iiio/'.s« lhaliana has two 
MTR4 homologues, designated AtMTR4 and HEN2 [1 1-13]. We 
have previously shown that AtMTR4 (encoded by Atlg59760) is a 
predominandy nucleolar protein required for the efficient degra- 
dation of misprocessed 5.8S rRNA precursors and specific 
fragments of the 5' external transcribed spacer (5' ETS), a by- 
product released during processing of three rRNAs from their 
common precursor transcript [13]. The requirement for AtMTR4 
in efficient rRNA production is reflected by the phenotype otmtr4 
mutants, which show a characteristic combination of developmental 
growth defects also observed in ribosomal protein mutants and in 
other Arabidopsis mutants lacking putative ribosome biogenesis 
factors such as nucleolin [14-19]. 

HEN2 {HUA enhancer 2, At2g06990) was originally identified 
in a genetic screen for mutations that enhance the flower 
morphology defects observed in hual and hua2 mutants [12]. A 
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Author Summary 

Cells rely on a number of RNA degradation pathways to 
ensure correct and timely processing and turnover of both 
coding and non-coding RNAs. Another important function 
of RNA degradation is the rapid elimination of mispro- 
cessed RNA species, maturation by-products, and non- 
functional RNAs that are frequently produced by pervasive 
transcription. The main 3'-5' RNA degradation machine in 
eukaryotic cells is the exosome, which is activated by 
cofactors such as RNA helicases. In yeast and human, 
processing, turnover and surveillance of all nuclear 
exosome targets depend on a single RNA helicase, MTR4. 
We show here that the Arabidopsls exosome complex can 
associate with two related RNA helicases, MTR4 and HEN2. 
IV1TR4 and HEN2 reside in nucleolar and nucleoplasmic 
compartments, respectively, and target different subsets of 
nuclear RNA substrates for degradation by the exosome. 
The presence of both IV1TR4 and HEN2 homologues in 
green algae, mosses and land plants suggest that the 
functional duality of exosome-associated RNA helicases is 
evolutionarily conserved in the entire green lineage. The 
emerging picture is that, despite a high degree of 
sequence conservation, intracellular distribution, activities 
and functions of exosome cofactors vary considerably 
among different eukaryotes. 

follow-up study showed that hen2 single mutants accumulate, as 
compared to wild type plants, slightly higher levels of a 
polyadenylated transcript comprising the two first exons and a 
large portion of the second intron of the AGAMOUS gene 
product, suggesting that the HEN2 protein could be involved in 
the degradation of misprocessed AGAMOUS transcripts [20]. 
These data and the strong homology with the exosome activator 
MTR4 prompted us to examine possible roles of HEN2 in 
exosome-mediated RNA degradation in Arabidopsis thaliana. In 
contrast to AtMTR4, HEN2 is not required for processing or 
degradation of 5.8S rRNA precursors or the elimination of the 5' 
ETS [13]. We show here that HEN2 is a nucleoplasmic protein 
that is associated with the Arabidopsis exosome core complex and 
has a specific role in the exosome-mediated degradation of non- 
coding RNAs, misprocessed mRNAs, introns and transcripts 
derived from retrotransposons and non-genic regions. Interestingly 
and as recendy reported for human MTR4 [9,21], HEN2 
associates with homologues of the NEXT (for Nuclear Exosome 
Targeting) complex components and also co-purifies with the cap- 
binding complex. MTR4, by contrast, is associated with a distinct 
set of proteins, many of which appear to be involved in ribosome 
biogenesis. Our results indicate a high degree of spatial and 
functional specialization of exosome activating RNA helicases in 
Arabidopsis. 

Results 

Both ATMTR4 and HEN2 are associated with plant 
exosome complexes 

Several co-factors of the Arabidopsis exosome such as RRP6L 1 
(RRP6-LIKE 1), RRP6L2, MTR4 and DIS3/RRP44 have been 
identified based on sequence homology with yeast counterparts 
and are genetically linked with nuclear exosome functions [13,22- 
25]. However, none of them has yet been shown to physically 
interact with the exosome. To better define the composition of 
plant exosome complexes, we used myc-tagged and GFP-tagged 
versions of the exosome core subunit RRP41 as baits in co- 
immunoprecipitation (IP) experiments. RRP41 fusion proteins 



were expressed under the control of the 1 kb genomic region 
upstream of the endogenous RRP41 gene. Both myc-tagged 
RRP41 and GFP-tagged RRP41 constructs were able to 
complement the otherwise lethal rrp41-X null mutation. A similar 
fuU phenotypic complementation was previously reported for 
TAP-tagged RRP41 [5]. Complementation of the nuH mutation 
by both RRP41-myc and RRP41-GFP suggested that both fusion 
proteins can be integrated in the core exosome complex. To test 
this hypothesis and identify potential exosome co-factors, tagged- 
RRP41 and associated proteins were affinity-purified using 
superparamagnetic particles coated with anti-myc or anti-GFP 
monoclonal antibodies, respectively. As shown for myc-tagged 
RRP4 1 IP, a specific group of proteins were visualized on silver- 
stained SDS-PAGE gel as compared with mock IP (Fig. 1). The 
proteins co-purifying with RRP41 were identified by mass 
spectrometry (nano LC-MS/MS) analysis. A final list of 14 
proteins was established by excluding proteins present in mock 
purifications and by crossing the datasets of three biological 
repeats (Table 1). All 14 proteins were identified with both Mascot 
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Figure 1. Purification of exosome complexes. Silver-stained SDS- 
PAGE of proteins co-immunoprecipitated with RRP41-myc. Similar 
results were obtained when GFP-tagged RRP41 was used as bait. Tfie 
three main bands observed in mock IP and also present in RRP41-myc IP 
correspond to immunoglobulins. 
doi:1 0.1 371/journal.pgen.1 004564.g001 
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and PEAKS DB algorithms with a false discovery rate < 1 % . An 
exhaustive list of the peptides shared by the three RRP41 IPs is 
shown in Table SI. 

As expected, all nine canonical core subunits of EX09 (RRP4 1 , 
RRP42, RRP43, RRP45B, RRP46, MTR3, CSL4, RRP4 and 
RRP40A) were identified which confirms that we indeed purified 
intact exosome complexes (Table 1, Table SI). In addition to the 
nine exosome subunits that were previously characterized [5] , five 
novel proteins were detected, albeit with lower number of spectra 
reflecting a lower abundance as compared to the canonical EX09 
subunits (Table 1, Table SI). 

(i) An isoform of RRP45B, RRP45A, was unambiguously 
identified by several discriminating peptides (given in bold 
in Table SI). In Arahidopsis , the RRP45 subunit is 
encoded by two genes: RRP45A and RRP45B/CER7 
[5,26]. Although single rrp45a and rrp45h mutants are 
both viable, a double knock out is lethal [26] . The rrp45h/ 
cer7 mutant is characterized by a defect in cuticular wax 
accumulation, which is not observed in rrp45a mutants, 
suggesting specialized functions for both isoforms [26]. 
Only RRP45B/CER7 was previously detected in EX09 
[5] , likely because of its higher expression level. Our results 
show that both isoforms are incorporated in plant exosome 
core complexes. 

(ii) A protein of unknown function RESURRECTION 1 
(RSTl) also consistently co-purified with EX09 (Table 1, 
Table SI). Interestingly, RSTl is crucial for cuticular wax 
accumulation, the very same biological process affected by 
lack of RRP45B [26,27]. The potential interaction 
between RSTl and EX09 is being assessed and is not 
the focus of this study. 

(iii) DIS3/RRP44 was also detected in all three biological 
replicates indicating that this exoribonuclease can associate 
with the plant EX09 core exosome as reported for yeast 
and animal exosomes [28-31]. However, as for the other 
putative co-factors that we detected by LC-MS/MS, 
DIS3/RRP44 is not visible on silver-stained gels (Fig. 1) 
and is detected with only few spectra by LC-MS/MS 
(Table SI), presumably reflecting a labile association to 
plant EX09 under the biochemical conditions used for 
immunoprecipitation experiments. Since none of the three 
RRP6L present in Arahidopsis was detected in our analysis, 
DIS3/RRP44 represents so far the only exoribonuclease 
whose physical interaction with plant EX09 can be 
inferred from mass spectrometry data. 

(iv) In all three replicates, discriminating peptides indicated 
that two RNA hehcases, AtMTR4 and HEN2, co-purified 
with EX09 (Table 1, Table SI). AtMTR4 was previously 
shown to participate in processing or degradation of 
ribosomal RNA precursors [13]. HEN2, a close relative of 
AtMTR4 was also suggested to play a role in RNA 
degradation [12,20], but its substrates or its role in 
exosome-mediated pathways have not been studied yet. 

HEN2 is a plant-specific member of the SKI2/MTR4 RNA 
helicase family 

AtMTR4 and HEN2 share 43% identity and 59-60% similarity 
with yeast MTR4 and with each other, but only 24% identity/ 
39% similarity with SKI2, a cofactor of the cytoplasmic exosome 
[7,32,33]. Structural modeling of HEN2 and AtMTR4 confirmed 
that both possess an arch domain, a characteristic feature of 
MTR4/SKI2 RNA helicases [34-37]. While the modeled 



structure of HEN2 matches closely to the structure of yeast 
MTR4, AtMTR4 has an insertion of nine amino acids in the RNA 
binding part of the arch domain, the KOW-motif [34,35]. 
Interestingly, a similar insertion is present in the KOW motifs of 
all plant MTR4 proteins investigated (Fig. SI). Other character- 
istic sequence differences between AtMTR4 and HEN2 concern 
RecA-domains, the arch domain and the C-terminal helix-loop- 
helix domain (Fig. S2), respectively, and allow the reliable 
discrimination of HEN2 and AtMTR4 homologues by sequence 
alignment algorithms. A search for homologues of AtSKI2, 
AtMTR4 and HEN2 in all genomes available at www. 
phytozome.net shows that all three RNA helicases are conserved 
throughout green algae, mosses and land plants. A phylogenetic 
analysis of related proteins from animals, fungi and other 
eukaryotic clades revealed that most organisms possess both a 
single MTR4 and a single SKI2 protein; however, HEN2 
homologues are restricted to the green lineage (Fig. 2). Despite 
of the short insertion in the KOW motif, plant MTR4 proteins 
cluster with the MTR4 proteins of other organisms, while HEN2 
proteins form a separate clade. Taken together, these data suggest 
that HEN2 is a plant-specific isoform of the nuclear exosome 
activator MTR4. 

AtlVlTR4 and HEN2 have distinct intranuclear localization 
patterns 

To extend previous localization studies [13], we transiently 
expressed HEN2 and AtMTR4 GFP fusion proteins in Nicotiana 
benthamiana leaves, alongside with RFP-labeled XRN2 and 
FibriUarin as nucleolar markers, and with SRP34 as a nucleoplas- 
mic marker [38-43]. Similar to XRN2-RFP and Fibrillarin-RFP, 
AtMTR4-GFP was detected in the nucleus, strongly enriched in 




Figure 2. HEN2 proteins are restricted to the green lineage. 

Phylogenetic tree of the MTR4/SKI family of RNA helicases. The basic 
branch that separates HEN2 homologues from MTR4 and SKI2 
homologues is detected with 1000/1000 bootstraps. Protein sequences 
were retrieved from metazome, phytozome and JGI databases and 
aligned with ClustalX. Dark red, vertebrates; light red, other eumetazoa; 
pink, M. brevicollis (Choanozoa); blue, 6. natans (Rhizaria); orange, N. 
gruberi (Heterobolosea); yellow, S. cerevisiae and S. pombe (Fungi); olive, 
F. cylindrus, P. tricornutum, T. pseudonana; P. cinammomi, P. sojae 
(Heterokonta); light green, green algae; green, mosses; blue-green, 
grasses; dark green, dicotyledons. Scale bar = 0.05 amino acid substi- 
tutions per site. 

doi:10.1371/journal.pgen.1004564.g002 
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nucleoli (Fig. S3). HEN2-GFP and SRP34-RFP were detected 
only in the nucleoplasm (Fig. S4). Next, we determined the 
intracellular localization of HEN2-GFP in root tips of stable 
Arabidopsis thaliana transformants. For comparison, we examined 
roots of plants expressing either AtMTR4-GFP or the exosome 
core subunits RRP4-GFP and RRP41-GFP. As expected, RRP4- 
GFP and RRP41-GFP were observed in both cytoplasm and 
nuclei, and enriched in nucleoli (Fig. 3, Fig. S5). As reported 
before, AtMTR4-GFP was observed predominantly in nucleoli, 
and only a faint signal was observed in the nucleoplasm (Fig. 3, 
Fig. S5) [13]. HEN2-GFP was observed in the nucleoplasm, was 
enriched in nuclear foci and appeared excluded from nucleoli 
(Fig. 3, Fig. S5, Fig. S6). These results show that HEN2 is a 
nucleoplasmic protein, and that AtMTR4 and HEN2 are for the 
most part located in distinct subnuclear compartments. 

AtlVlTR4 and HEN2 interact with different sets of proteins 

To investigate whether AtMTR4 and HEN2 are associated with 
specific proteins reflecting their distinct localization and to confirm 
that both helicases interact with EX09, we performed IP 
experiments using plant lines expressing GFP-tagged versions of 
AtMTR4 or HEN2 in their respective mutant backgrounds. The 
list of proteins co-purifying with AtMTR4-GFP or HEN2-GFP 
was established by considering only proteins that were not 
identified in mock purifications and common to replicate IPs for 
AtMTR4-GFP and HEN2-GFP experiments, respectively (Ta- 
bles 2 and 3; Tables S2 and S3). 

AU canonical nine subunits of EX09 were identified in both 
AtMTR4-GFP and HEN2-GFP datasets, which confirmed that 
both RNA helicases interact with the Arabidopsis exosome 
complex. Remarkably, EX09 subunits were the sole common 
proteins among the 43 and 16 significant proteins present in 
AtMTR4-GFP and HEN2-GFP IPs, respectively (Tables 2 and 3). 
A Gene Ontology (GO) analysis for the 34 proteins that were 
specifically co-purified with AtMTR4-GFP exposed that the most 
significant biological process GO term was ribosome biogenesis 
(Benjamini-Hochberg corrected p-value, 1.3E-12), which tagged 9 
out of 34 proteins. Further data mining revealed that additional 1 2 



proteins have a proven or predicted role in ribosome biogenesis 
(Table 2). Nine out of the 13 remaining proteins corresponded to 
transducin/WD40 repeat proteins and/or proteins involved in 
nucleic acid metabolism (Table 2, Table S2). These results are in 
agreement with our previous results [13] and further substantiate 
the role of AtMTR4 in maturation and/or degradation of 
ribosomal RNA. 

HEN2-GFP co-purified the nine canonical subunits of EX09, 
the alternative subunit RRP45A and 6 additional proteins 
(Table 3, Table S3). One of the six proteins that co-purified with 
HEN2-GFP was a homologxie of the exon junction complex (EJC) 
component MAGO NASHI. Two proteins were the subunits of 
the cap binding complex (CBC), CBP80 (AT2G 13540) and 
CBP20 (AT5G44200). Finally we identified three putative RNA 
binding proteins, two of which encoded by AT5G38600 and 
AT4G10110 had high spectral counts (Table 3, Table S3). 
AT5G38600 is a 532 amino acid protein containing the 
CX2CX4HX4C zinc-knuckle motif (Pfam 14392), particularly 
found in plant proteins [44]. A BLAST analysis against the 
human proteome identified ZCCHC8 as the best sequence 
homologue (E-value 4e-26). ZCCHC8 is a zinc -knuckle protein 
that was recentiy identified as part of the human Nuclear 
EXosome Targeting (NEXT) complex [9]. At4G10100 is a small 
protein of 173 amino acids that shares some similarity is related to 
the second component of the human NEXT complex, RBM 7, 
although the similarity is restricted to the N-terminal two-thirds of 
the 266 amino acids of RBM 7 (26% identity, 27% similarity for 
the ahgned sequence. Fig. S7). To further check whether HEN2, 
AT5G38600 and AT4G10110 form a NEXT-like complex in 
Arabidopsis, we slightly increased the stringency of HEN2 
immunoprecipitation conditions. By a modest increase of ionic 
strength from 50 to 150 mM NaCl, the co-purification of EX09 
with HEN2-GFP was lost. However, both AT5G38600 and 
AT4G 10110 were still present in duplicate immunoprecipitations 
(Table 3, Table S3). Interestingly, a third RNA-binding protein, 
AT1G67210, was identified in aU four HEN2 IPs, albeit with a 
lower spectral count (Table 3, Table S3). As AT5G38600, 
AT1G67210 also contains the CX2CX4HX4C zinc-knuckle motif 
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Figure 3. IVITR4 and HEN2 have distinct localization patterns. Distribution of GFP-fusion proteins in root cells of stable Arabidopsis 
transformants. No, Nucleolus; Np, Nucleoplasm; Cp, Cytoplasm. Scale bars 5 |im. 
doi:1 0.1 371/journal.pgen.1 004564.g003 
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(Pfaml4392) and both proteins share 49.6% identities and 57.6% 
similarities. These results support the existence of a NEXT-like 
complex in Arahidopsis and raise the interesting possibility that 
multiple NEXT-like complexes exist in plants. 

Taken together, our data show that AtMTR4 and HEN2 are 
associated with distinct sets of proteins. AtMTR4 co-purifies with 
the exosome, and with putative ribosome biogenesis factors, which 
highlights the function of AtMTR4 in pre-rRNA processing and 
degradation. HEN2 co-purifies with the exosome, the CBC 
complex and with two types of RNA-binding proteins to form a 
putative plant NEXT-like complex. These data suggest that the 
functional link between exosome, CBC and NEXT complexes that 
was recendy established in human cells [21] may be conserved in 
plants. Furthermore, these results strengthen the evidence for a 
functional specialization of HEN2 and MTR4. 

A selection of known exosome substrates accumulates In 
hen2 mutants 

So far, our data suggested that HEN2 might operate as cofactor 
of the nucleoplasmic exosome complex. In order to investigate the 
function of HEN2 for the degradation of exosome substrates, we 
tested the accumulation of a pseudogene and five non-coding 
RNAs selected from the list of known polyadenylated plant 
exosome substrates [5]. Targets comprised the pseudogene 

1 Atlg79245, the non-coding RNAs MRP and 7SL, the dicistronic 
c precursor of snoRNAs At3g58193 and At3g58196, a non-coding 

RNA encoded by At2g 18440, and intergenic transcripts generated 
■g from a repeat region located on chromosome 5 [5]. Additional 

information and hyperlinks to visualize the selected regions on the 

2 SALK transcriptome/exosome website (http://signal.salk.edu/) 
"g are provided in Fig. S8. Steady-state levels of the six selected 
"S exosome targets were determined by quantitative RT-PCR using 

2 oligo-dT primed cDNA samples prepared from seedlings of wild 
5 type, mtr4-l, mtr4-2, hen2-2 or hen2-4 mutant plants. We 
£ included also samples from RRP41 RNAi lines in which depletion 
^ of the exosome core subunit RRP41 is triggered by an inducible 

3 RNAi construct [5]. As shown in Fig. 4, all exosome substrates 

1 tested in this experiment were over-accumulated in hen2 samples 
§ as compared to wild type samples. By contrast, no or only a mild 
S accumulation was observed in mtr4 mutants. These data provided 

2 a first indication that HEN2 is involved in the degradation of 
i nuclear exosome targets that are not substrates of AtMTR4. 

D 

0 HEN2 participates in mRNA surveillance 

To evaluate the respective contribution of HEN2 and AtMTR4 
S to the degradation of nuclear exosome substrates in an unbiased 

^ manner, we determined the accumulation of polyadenylated 

S transcripts using fuU-genome (tiling) microarray arrays. For this 

1 experiment, cDNA was prepared from two biological replicates of 
§ wild type, mlr4-l and hen2-4 mutants. Each mutant sample was 
^ co-hybridized against a wild type sample to NimbleGen A. 
■~ thaliana 732K whole genome microarrays. The microarray chip 
S g contains 1,434,492 strand-specific probes covering both coding 
i J and non-coding regions with an average resolution of 175 nt. 

Probes are 45-85 nt long and designed to achieve a constant Tm 
of ~76°C to enhance hybridization consistency across probes. For 
each biological replicate, expression of each mutant was compared 
to the expression of the wild type. The statistical analysis, based on 
a 4-state Hidden Markov Chain, classified probes into four clusters 
corresponding to over-expressed probes, under-expressed probes, 
probes with unchanged expression, and noise (not expressed 
probes), respectively. Interestingly, the analysis did not declare any 
probe as under-expressed in both biological replicates. This result 
is in line with the prediction that loss of HEN2 and AtMTR4 
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Table 3. Proteins co-purified with HEN2-GFP at 50 mM and 150 mM ionic strength, and identified by LC-MS/IVIS. 
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EX09, Exosome core complex; NEXT, putative Nuclear EXosome Targeting Complex; CBC, Cap-Binding Complex; BC, Exon-Junction Complex. 
doi:l 0.1 371 /journal.pgen.l 004564.1003 



impair.s RNA degradation, and therefore results predominantly in 
an increased accumulation of RNA substrates. Indeed, signals for 
1860 unique probes were significantly increased in both biological 
replicates of hen2 samples. 499 probes were identified as 
overexpressed in both biological replicates of mtr4 samples. A file 
allowing the visuahzation of the upregulated probes aligned to the 
Arabidopsis genome can be found in dataset SI. For the further 
analysis, we sorted the probes according to their genome 
coordinates to identify upregulated regions. Only regions with at 
least two consecutive probes were considered for interpretation. 
Upregulated regions were then grouped with respect to annotated 
features taking into account both TAIRIO annotated genes and 
recendy identified genes encoding snoRNAs, miRNAs and 
lincRNAs [45^7]. This procedure identified 387 regions, the 
majority of which was upregulated exclusively in hen2 samples 
(Tables S4, S5, S6, S7, S8, S9, SIO, Sll). 

237 of the upregulated regions mapped to protein coding genes. 
However, for the majority of tlie cases, (149 regions, 112 of which 
were only observed in hen2 samples. Table S4), the upregulated 
transcripts were apparendy not mature mRNAs. In fact, most of the 
upregulated regions corresponded to short portions of protein coding 
genes (Table S4). The upregulation of short regions located in the 3' 
portion of protein coding genes was validated by qRT-PCR analysis 
for diree examples (Fig. 5). As a positive control of exosome-mediated 
RNA degradation, we used the RRP41 RNAi line. In aU three cases, 
we indeed observed the accumtilation of transcripts corresponding to 



3' regions in both the hen2 and the rrp41 samples (Fig. 5). Another 
group of upregulated regions mapped within the body of the 
transcripts and beyond mature 3' ends (Fig. 6, Table S4), indicative 
of alternative 3' end processing or readthrough transcription. 
Furthermore, many upregulated regions contained both exonic and 
intronic sequences, suggesting the accumulation of incompletely 
spliced transcripts. To test this possibility, we compared by qRT-PCR 
the steady-state levels of individual exons, introns, regions comprising 
unspliced intron-exon junctions and correctly spliced transcripts from 
selected loci (Fig. 7, Fig. S9). These experiments conftrmed the 
overaccumulation of transcripts comprLsing unspliced donor or 
acceptor sites in two independent T-DNA insertion alleles of 
HEN2 and in RRP41 RNAi plants (Fig. 7A, Fig. S9). For most 
loci, we detected the upregulation of both unspliced and spliced 
transcripts (albeit to different levels, please note the scales in Fig. 7 
and S9), suggesting that heterogeneous transcripts are produced and 
targeted for degradation. In order to map the 3' extremities of the 
unspliced transcripts, we amplified and sequenced transcripts derived 
from tiie Atlg79270 locus by 3' RACE-PCR (Fig. 7B). cDNA 
synthesis was initiated by oligo dT, and PGR products were amplified 
with nested forward primers situated in the 3' region of the first exon, 
and the adapter sequence of cDNA synthesis primer as a reverse 
primer. All PGR products amplified from WT, mtr4 or RRP41 
control plants corresponded to the fuUy spliced mature mRNA 
(Fig. 7B). By contrast, a smaller product of only about 500 bp was 
amplified from hen2 or RRP41 RNAi plants, and corresponded to a 
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Figure 4. Loss of HEN2 results in over-accumulation of selected 
exosome targets. Steady-state levels of exosome targets selected 
from [5], (see also Fig. S8) in hen2 and mtr4 mutant seedlings were 
determined by qRT-PCR. Samples from an inducible RRP41 RNAi line 
grown in absence {RRP41 Ctrl) or presence {RRP41 RNAi) of estradiol 
were included as controls. The histogram shows the fold change 
relative to wild type. mtr4-1 in red, mtr4-2 in orange, hen2-2 in light 
green, hen2-4 in dark green, RRP41 control in light grey, RRP41 RNAi in 
dark grey. Error bars = SD in three biological replicates. 
doi:1 0.1 371 /journal.pgen.1 004564.g004 



population of transcripts that comprised the unspliced donor site of 
the first intron (Fig. 7B, Fig. SIO), while 3' extremities were close to 
the acceptor site. Of the 20 clones that were obtained from the hen2- 
4 sample 18 were polyadenylated at or close to the intron acceptor 
site (Fig. SIO), indicating that they are indeed marked for degradation 
by the nuclear exosome. The remaining 2 clones had polyadenylation 
sites 8 and 52 nt upstream of the acceptor site and likely represent 
degradation intermediates (Fig. SIO). Hence, both qRT-PCR and 3' 



RACE results confirmed the results of the microarray analysis and 
show that polyadenylated transcripts with incorrectly spliced introns 
accumulate in hen2 plants. 

The tiling data suggested that HEN2 also participates in the 
elimination of excised introns. In fact, 34 of the 237 protein coding 
regions that were detected in the microarray analysis corresponded 
exclusively to intron regions (Table S5), 28 of which were only 
observed upon loss of HEN2. As for incompletely spliced 
transcripts, qRT-PCR experiments confirmed the accumulation 
of intronic regions in two independent alleles of hen2 and in 
RRP41 RNAi samples (Fig. 8). Finally, 54 regions detected by the 
tUing analysis likely corresponded to mature mRNAs since they 
were regions with all probes in exons, and the detected regions 
spanned at least 50% of the mRNA. 33 mRNAs were detected in 
hen2, 9 mRNAs in 7ntr4, and 12 in both hen2 and mtr4 samples 
(Table S6). The upregxilation of the pseudogene Atlg79245 
(Fig. 4) and several other mRNAs in hen2 and mtr4 samples was 
vahdated by qRT-PCR (Fig. SI 1). However, not all of the tested 
mRNAs were also found upregulated in RRP41 RNAi samples 
(Fig. Sll). Moreover, none of the positively tested mRNAs was 
previously identified as an exosome-regulated mRNA [5] . Finally, 
some mRNAs were only detected in the very same samples that 
have been used for the microarray, but not in other mutant plants 
grown in the same culture conditions (Fig. SI 2). This inconsistence 
was in sharp contrast to all other types of substrates that were 
tested in the course of the study, which were reproducibly detected 
in all replicates, in independent hen2 T-DNA insertion mutants, 
and in RRP41 RNAi lines. Therefore, we doubt that all of the 
mRNAs that were detected in our tiling analysis represent true 
substrates of exosome-mediated decay. Although nuclear degra- 
dation can probably contribute to mRNA degradation [3,5,48], 
the upregulation of mRNAs can also be explained by indirect 
effects, e.g. a diflFerential response of WT and mutant plants to 
growth conditions. Indeed, data mining revealed that many of the 
mRNAs detected by the tiling arrays are linked to stress response 
(Table SI 2). Hence, the majority of the mRNAs that we detected 
by the tiling array are probably not bona fide substrates of HEN2 
or the nuclear exosome. By contrast, the upregulation of short 
mRNA-derived transcripts (Fig. 5, Table S4), 3' extended mRNAs 
(Fig. 6, Table S4), unspliced transcripts, (Fig. 7, Table S4, Fig. S9, 
Fig. SIO), introns (Fig. 8, Table S5) is consistently detected in all 
replicates of both mutant alleles of hen2 and in RRP41 RNAi 
samples, and can be considered as bona fide substrates of the 
nuclear exosome and the RNA helicase HEN2. 

Taken together, our data indicate that loss of HEN2 results in the 
accumulation of short transcripts derived from mRNA regions, 3' 
extended transcripts, incompletely spliced mRNA transcripts, and 
excised introns. The polyadenylated status of the accumulated 
transcripts indicates that they are tagged for degradation by the 
exosome, and indeed, all these classes of mRNA-derived transcripts 
have been described as exosome targets [5]. Hence, the most 
straightforward explanation for the accumulation of these tran- 
scripts in hen2 mutants is that HEN2 is required for the exosome- 
mediated elimination of different types of probably unfunctional 
RNAs that are generated from protein coding genes. Only a small 
number of such transcripts were observed in mtr4 mutants (Table 
S4, S5) and accumulated at lower levels (Fig. 5-8, Fig. S9), 
indicating that MTR4, as compared with HEN2, has a rather minor 
contribution to nuclear mRNA surveillance. 

HEN2 is the major RNA helicase for the degradation of 
non-coding nuclear exosome substrates 

Next, we examined the contribution of HEN2 and MTR4 to 
the degradation of non-coding transcripts. Of the 387 upregulated 
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Figure 5. hen2 mutants accumulate short transcripts derived 
from mRNA genes. qRT-PCR analysis. A diagram of the genomic locus 
indicated by the respective AGI number is shown at the top of each 
panel. Annotated mRNA genes are represented as arrows with dark 
blue boxes for the CDS, light blue boxes for 3' and 5' UTRs, and a light 
blue line for introns. Red bars above the diagram represent probes 
detected in the tiling analysis. Green arrows above or below the 
diagram depict the location of qRT-PCR primers. The corresponding 
qRT-PCR results for each primer pair are given as fold-change relative to 
WT in the histograms below each diagram. mtr4-l in red, mtr4-2 in 
orange, hen2-2 in light green, hen2-4 in dark green, RRP41 control in 



light grey, RRP41 RNAi in dark grey. Error bars = SD in three biological 
replicates. 

doi:10.1371/journal.pgen.1004564.g005 

regions detected in the microarray analysis, 150 regions mapped to 
non-coding regions. 9 regions mapped to transposable elements (6 
in hen2, ,3 in mtr4) and were not further investigated (Table S7). 45 
regions, all of which were exclusively observed in hen2 mutants, 
contained one or more snoRNA genes (Table S8), including the 
regions encoding snoRNAs At3g58193 and At3g58196 (Fig. 4). In 
fact, almost all of the snoRNA regions detected in our tUing array 
have also been previously identified as exosome substrates (see 
Table S8 last column, and [5]). To further investigate the 
contribution of HEN2 to the degradation of snoRNA precursors, 
we tested two additional snoRNA regions by qRT-PCR (Fig. 9). 
The results indicated a strong accumulation of snoRNA precursors 
in both hen2 mutant lines and in RRP41 RNAi samples. The 
preferential accumulation of snoRNA precursor transcripts was 
further confirmed by transferring 3' RACE-PCR products to 
membranes followed by hybridisation with radiolabelled probes 
(Fig. SI 3). These data strongly indicate that HEN2, but not 
MTR4, plays an important role for the degradation of snoRNA 
precursors. 

Other non-coding RNA regions were also unequally distributed 
between hen2 and mtr4 samples. 22 regions in hen2 samples 
encoded lincRNAs, putative miRNA precursors or other non- 
coding RNAs (Table S9), among them a portion of the region 
encoding GUT 15 (for gene with unstable transcript 15)/ 
At2gl8440 (Fig. 4). Only 2 of such non-coding RNA regions 
were observed in mtr4 samples (Table S9), indicating that MTR4 
plays a minor role for the degradation of non-ribosomal non- 
coding RNAs. Similarly, we detected 29 putative antisense 
transcripts in hen2 samples, while only 2 potential antisense 
regions were upregulated in mtr4 samples (Table SIO). To confirm 
the polyadenylated status of antisense transcripts, we selected one 
of the potential antisense regions for 3' RACE experiments. 
Cloning and sequencing of the PCR products revealed that 0 of 32 
clones obtained from WT samples corresponded to the target 
sequence (Fig. 10, Fig. S14). By contrast, 28 of 32 clones obtained 
from hen2 samples and 3 of 32 clones obtained from mtr4 samples 
corresponded indeed to antisense transcripts derived from the 
target region (Fig. 10, Fig. SI 4). Antisense sequences were 
polyadenylated, a hallmark of exosome-mediated RNA degrada- 
tion, and between 67 and 208 nt long (Fig. 10, Fig. S14). These 
data strongly suggest that the antisense transcripts derived from 
the At5g44306 locus are indeed substrates of polyadenylation- 
mediated decay facilitated by HEN2 and the RNA exosome. 
Finally, the microarray analysis detected 43 regions without any 
annotated genome features, including the intergenic repeat region 
on chromosome five that was already detected by our initial qRT- 
PCR experiments (Fig. 4). Similar to the distribution of non- 
coding RNA regions and potential antisense transcripts, the 
majority of the non-annotated regions (38 of 42) were exclusively 
observed in hen2 samples, while only 4 of 42 regions were found in 
mtr4 samples. These results indicate that the elimination of 
spurious transcripts generated from antisense or non-annotated 
regions that are usually described as the "dark matter" of the 
transcriptome relies mosdy on HEN2. 

Taken together, the microarray analysis revealed that a large 
number of exosome targets, including short or incompletely 
spliced transcripts derived from mRNA genes, precursors and 
processing by-products of non-coding RNAs, and spurious 
transcripts generated from antisense and intergenic regions 
accumulate specifically in hen2 mutants (Fig. 1 1). This indicates 
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Figure 6. Accumulation of 3' extended mRNAs in hen2 mutants. 

qRT-PCR experiments to test the upregulation of regions that mapped 
within the body and beyond annotated 3' ends of mRNAs. Please see 
legend of Fig. 5 for a detailed explanation of the diagrams. 
doi:10.1371/journal.pgen.1004564.g006 

that HEN2 has a major function in the elimination of many 
different types of nuclear exosome substrates. A much smaller 
number of such non-ribosomal exosome substrates accumulated in 
mtr4 plants (Fig. 1 1), and average accumulation levels in mtr4 
samples were lower than in hen2 samples (Fig. SI 5). These data 
indicate that AtMTR4, though it can apparently contribute at 
least to the degradation of mRNA-derived transcripts, plays a 
rather minor role in nuclear RNA surveillance. 

HEN2 has an antagonistic effect on post-transcriptional 
gene silencing mediated by a sense transgene (S-PTGS) 

Previous data indicated that mutations in 5'-3' exoribonucleases 
in nucleoli (XRN2), nucleoplasm (XRN3) or cytoplasm (XRN4) 
enhance the efficiency of post-transcriptional gene silencing 
mediated by sense transgenes (S-PTGS) [49]. Mutations in the 
exosome core components RRP4 and RRP4 1 , or mutations in the 
exosome co-factors RRP44/DIS3 and RRP6L1 also enhance S- 
PTGS, suggesting that both 5'-to-3' and 3'-to-5' RNA degrada- 
tion pathways limit the entry of aberrant transgene RNAs into the 
S-PTGS pathway [50]. To determine in which nuclear compart- 
ment the exosome counteracts transgene PTGS, we analyzed the 
effect of mtr4 and hen2 mutations on S-PTGS using the 
Arabidopsis reporter line Hcl [49-52]. Line Hcl carries a 
35S::GUS transgene that triggers S-PTGS at a frequency of 
20% at each generation, making this line ideal for identifying 
mutations that either increase or decrease S-PTGS efficiency. 
However, only EMS mutants or 35S CaMV promoter-free T- 
DNA insertion mutants are amenable for such analyses because 
copies of the 35S CaMV promoter present in the SALK, WISC or 
GABI T-DNA collections often interfere with the expression of 
35S CaMV promoter-driven transgenes, which could report an 
impact on S-PTGS that is not directly related to the function of the 
mutated gene [53]. Only the T-DNA mtr4-2 [13] and the EMS 
hen2-l mutant [12] fit this requirement. Hcl/mtr4-2 plants 
triggered S-PTGS at a frequency of 25% (n = 96), which is only 
slightly higher than S-PTGS frequency in Hcl controls (Fig. 12A). 
These data indicate that compromising exosome activity in 
nucleoli has only a limited effect on transgene S-PTGS. For 
comparison, loss of the nucleolar exoribonuclease XRN2 was 
previously shown to trigger S-PTGS at a frequency of 47% [49] 
(Fig. 12A). In contrast, compromising the nucleoplasmic 5'-to-3' 
exonuclease XRN3 had a stronger effect (Fig. 12A) [49]. To 
determine if loss of the nucleoplasmic protein HEN2 also affects S- 
PTGS, the hen2-l mutation, which is in the Landsberg erecta 
(Ler) ecotype, was crossed to an Hcl/Ler line resulting from ten 
backcrosses of Hcl to Ler. Remarkably, no Hcl/Ler plants 
exhibited S-PTGS (n = 96, Fig. 12B), suggesting that either L«r is 
less prone to trigger S-PTGS than Col or that Hcl has lost its 
capacity to trigger S-PTGS after ten backcrosses to Ler. This later 
hypothesis was ruled out by crossing Hcl/Ler to xrn4-l (in Ler). 
Mutations in the cytoplasmic 5'-to-3' exonuclease XRN4 are 
known to enhance S-PTGS in Col [49] (Fig. 12A), so S-PTGS was 
expected to occur in Hcl/xrn4-l/Ler plants if the Hcl locus has 
retained its ability to trigger S-PTGS in Ler. S-PTGS was 
observed in 100% oi Hcl/xrn4-l/Ler plants (n = 96, Fig. 12B), 
indicating that Ler is less prone to trigger S-PTGS than Col, but 
that the Hcl/Ler line still is amenable to identify mutations that 
enhance S-PTGS. Indeed, 23% oi Hcl/hen2-l/Ler plants (n = 83) 
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Figure 7. hen2 mutants accumulate incompletely spliced 
transcripts. A qRT-PCR. The diagram shows the At1g79170 locus. 
The At1g79170 mRNA is represented as an arrow with dark blue boxes 
for the CDS, light blue boxes for 3' and 5' UTRs, and a light blue line for 
introns. Red bars above the diagram represent probes detected in the 
microarray analysis. Green arrows above or below the diagram depict 



the location of qRT-PCR primers. The corresponding qRT-PCR results for 
each primer pair are given as fold-change relative to WT in the 
histograms below each diagram. Please note the scales. mtr4-l in red, 
mtr4-2 in orange, hen2-2 in light green, hen2-4 in dark green, RRP41 
control in light grey, RRP41 RNAi in dark grey. Error bars = SD in three 
biological replicates. B 3' RACE PCR. The diagram shows the Atl g791 70 
locus. Forward primers used for 3' RACE-PCR on oligo-dT-primed cDNA 
are shown as green arrows above the diagram. A negative stain of PCR 
products separated on a 1.5% agarose gel is shown on the bottom. The 
upper band marked by a star corresponded to the fully spliced mRNA as 
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located at or upstream of the 3' acceptor site and were polyadenylated 
(see Fig. S10). 

doi:10.1371/journal.pgen.1004564.g007 

exhibited S-PTGS (Fig. 12B), indicating that compromising 
exosome activity in the nucleoplasm strongly enhances transgene 
S-PTGS. The antagonistic efiect of HEN2 on S-PTGS was 
confirmed in Col using an AGOl transgenic reporter system. In 
this system, transformation of wild type Col with a T-DNA 
carrying an ectopic pAGOl ::AG01 construct triggers cosuppres- 
sion (S-PTGS) of endogenous AGOl in 50% of the transformants 
[54] (Fig. 12C). Transformation oi hen2-2 (in Col) with the same 
pAG01::AG01 construct triggered AGOl cosuppression in 72% 
of the transformants (Fig. 12C). This increase is almost compara- 
ble to the effect of xrn4 on AGOl cosuppression (Fig. 12C), 
indicating that hen2 strongly affects S-PTGS in Col. Taken 
together, these data show that HEN2 counteracts S-PTGS in both 
Col and Ler, likely through its role as a co-factor of the 
nucleoplasmic exosome. 

Discussion 

We show here that two isoforms of MTR4, AtMTR4 and 
HEN2, assist the Amhidopsis exosome for the degradation of 
mostly distinct sets of nuclear RNA substrates. Both AtMTR4 and 
HEN2 co-purify with the Arahidopsis exosome core complex but 
AtMTR4 and HEN2 occupy primarily distinct intranuclear 
compartments. The main role of the nucleolar isoform AtMTR4 
is to assist in the exosome-mediated degradation of misprocessed 
rRNA precursors and maturation by-products [13]. Our new data 
show that the main function of the nucleoplasmic isoform HEN2 is 
to assist the exosome-mediated processing and/or degradation of 
snoRNAs and snoRNA precursors, mlRNA precursors, lincRNAs, 
and a large number of spurious transcripts derived from antisense 
and non-annotated regions. In addition, HEN2 is involved in the 
degradation of excised introns and incompletely sphced or 
otherwise mis-transcribed or mis-processed mRNAs. Of the 387 
regions detected in this study, 1 00 have been previously identified 
as targets of the Ar'ahidopsis core exosome (Tables S4, S5, S6, S7, 
S8, S9, SIO, Sll) using a different type of tiling microarray [5]. 
However, our qRT-PCR data indicate that majority of the HEN2 
substrates accumulate also in RRP41 RNAi samples, even though 
some of them were not detected previously [5], such as Atlg79270 
(Fig. 6), At3g26510 (Fig. S9) and Atlg58602 (Fig. S9). Vice versa, 
several known exosome substrates were not identified by our tiling 
analysis but easily detected by qRT-PCR (Fig. S16). These 
findings indicate that both tiling studies probably underestimate 
the contribution of HEN2 and the exosome to RNA surveillance. 
Only a small number of exosome substrates are observed in mtr4 
mutants, which indicates that AtMTR4 participates, but plays only 
a minor role in nuclear RNA surveillance. This is in line with the 
finding that AtMTR4 and HEN2 have marginal and strong 
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Figure 8. hen2 mutants accumulate regions of excised introns. 

qRT-PCR experiments to validate the upregulation of regions located 
within introns. Primers pairs V62a, V64a/b and V70ab are exon spanning 
and produce amplicons only from spliced mRNAs under the conditions 
used for qRT-PCR. Please refer to the legend of Fig. 5 for a detailed 
explanation of the diagrams. 
doi:10.1371/journal.pgen.1004564.g008 

contributions, respectively, to the exosome activity that likely 
degrades aberrant transgene RNAs in the nucleus to limit their 
entry in the PTGS pathway [49,50]. 
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Figure 9. hen2 mutants accumulate snoRNA precursors. qRT- 
PCR. A diagram of the genomic locus with the indicated snoRNA genes 
is shown at the top of each panel. Individual snoRNA genes are 
represented as yellow arrows. Red bars above the diagram represent 
probes detected in the tiling analysis. Green arrows above or below the 
diagram depict the location of qRT-PCR primers. The corresponding 
qRT-PCR results for each primer pair are given as fold-change relative to 
WT in the histograms below each diagram. mtr4-l in red, mtr4-2 in 
orange, hen2-2 in light green, hen2-4 in dark green, RRP41 control in 
light grey, RRP41 RNAi in dark grey. Error bars = SD in three biological 
replicates. 

doi:10.1371/journal.pgen.1004564.g009 
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Figure 10. Loss of HEN2 results in accumulation of polyade- 
nylated antisense transcripts. 3' RACE-PCR. A diagram of the 
At5g44306 locus is shown at the top. A blue arrow represents the 
At5g44306 mRNA. Red bars above the diagram represent probes 
detected in the tiling analysis. The location of the primer used for 3' 
RACE PCR is indicated by a green arrow above the diagram. Each of the 
orange horizontal bars below the diagram represents a polyadenylated 
clone obtained from the indicated sample. 28 of 32 clones obtained 
from hen2 samples and 3 of 32 clones obtained from mtr4 samples 
corresponded to antisense transcripts of 67 to 208 nt lenght. 0 of 32 
clones obtained from WT samples corresponded to the target 
sequence. 

doi:10.1371/journal.pgen.1004564.g010 

In a previous study, we have shown that hen2 mutants do not 
accumulate the 5.8S rRNA precursors and the 5' ETS that are 
observed upon down-regulation of AtMTR4 [13]. Accordingly, 
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Figure 11. Degradation of non-ribosomal exosome targets 
depends largely on HEN2. The transcriptomes of WT, hen2-4 and 
mtr4-l plants were compared by whole genome microarrays. The 
histogram shows the total number of regions that were, as compared to 
wild type, overaccumulated in hen2 or mtr4 samples, respectively. A 
complete list of the upregulated regions can be found in Tables S4, S5, 
S6, S7, S8, S9, S10, S11. 
doi:10.1371/journal.pgen.1004564.g011 

hen2 single mutants do not display any of the developmental 
defects linked to disturbed ribosome biogenesis or ribosome 
function that were observed in mtr4 mutants. These data were the 
first clues that AtMTR4 and HEN2 have rather distinct functions 
in plants. However, we were not able to obtain double mtr4 hen2 
mutants, signifying that simultaneous loss of both AtMTR4 and 
HEN2 is lethal [13]. The data presented here suggest that 
AtMTR4 can perform some of the functions of HEN2. For 
instance, a limited number of mRNA-derived fragments, non- 
coding RNAs and spurious transcripts accumulated in mtr4 single 
mutants, indicating that AtMTR4 can contribute, even in 
presence of HEN2, to the degradation of non-ribosomal exosome 
targets. In line with this, a small fraction of AtMTR4-GFP can be 
detected in the nucleoplasm of stable Arabidopsis transformants 
(Fig. SI 7, see also [13]). Taking together, these results indicate that 
a limited overlap between AtMTR4 and HEN2 functions exists. 
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Figure 12. Effect of mtr4 and /7e/72 mutations on posttranscrip- 
tional silencing. Diagrams show the proportion of plants that 
undergo systemic silencing of a GUS (panels A and B) or AG01 (panel 
C) reporter transgene in the indicated mutants and backgrounds. The 
color code indicates the intracellular localization of the mutated 
proteins: red marks the nucleolar proteins; green marks nucleoplasmic 
proteins, and blue mark cytoplasmic proteins. A. Loss of MTR4 has only 
a marginal effect on GUS S-PTGS. The effect of the mtr4-2 mutation on 
silencing of a GUS reporter transgene was tested in the Hcl/Col 
reporter system as described in [51]. For comparison, we included 
previously published data from [49] that show the effects of 
compromised 5'-3' exoribonucleases XRN2 (nucleolar), XRN3 (nucleo- 
plasmic) or XRN4 (cytoplasmic) in the same reporter system. B. Loss of 
HEN2 triggers silencing of a GUS PTGS reporter. The effect of hen2-1 (in 
Ler) on GUS-PTGS was tested in an Hcl -reporter line backcrossed to Ler. 
Please note that the Hcl /Ler reporter line does not trigger silencing of 
the GUS reporter in WT. xm4 in Hcl/Ler was used as a positive control. 
C. Loss of HEN2 triggers co-suppression of AG01 . To further confirm the 
role of HEN2 as silencing suppressor, WT, hen2-2 and xm4 plants (in Col) 
were transformed with a pAG0h:AG01 construct that triggers 
cosuppression of the endogenous AG01 gene. 
doi:1 0.1 371 /journal.pgen.1 004564.g01 2 



However, our data show that most exosome functions are 
activated either by AtMTR4 or by HEN2 in the nucleolus and 
the nucleoplasm, respectively. 

It is interesting to note that Arahidopsis has also specific 
nucleoplasmic and nucleolar isoforms (RRP6L1 and RRP6L2, 
respectively) of RRP6, a catalytically active exoribonuclease 
associated with nuclear exosomes in yeast and human [28,30]. 
So far, we and others [5] did not detect any of the plant RRP6-like 
proteins in plant exosome preparations. However, we have 
previously shown that the downregulation of the nucleolar isoform 
RRP6L2 leads to a mild accumulation of misprocessed 5.8S rRNA 
precursors and the 5' ETS, suggesting that RRP6L2 acts in the 
same degradation processes as AtMTR4 [13,55]. By contrast, we 
did not detect a significant overaccumulation of HEN2 targets 



upon down-regulation of the nucleoplasmic isoform RRP6L1 
(data not shown). A possible explanation is that RRP6L2 and 
RRP6L 1 can substitute for each other in the degradation of HEN2 
targets, since the two nuclear RRP6-like proteins appear to have 
both specific and common roles linked to the degradation of 
exosome targets [24,55]. Interestingly, a recent study revealed that 
RRP6L1, but not RRP6L2 has also a role in transcriptional 
silencing by retaining PolV transcripts on chromatin, thereby 
promoting the production of 24 nt siRNAs that direct DNA 
methylation via the RdDlVI pathway [56]. Remarkably, this 
function of RRP6L1 is independent of the core exosome [56]. By 
contrast, transcriptional silencing at soloLTR loci is mediated by 
both RRP6L and the core exosome [24]. In addition, RRP6L1 
and the exosome core complex have a common role in 21-nt 
siRNA-dependent posttranscriptional silencing (PTGS), since 
downregulation of either RRP41 or RRP6L1 alone is sufficient 
to enhance PTGS in the sensitive Hcl -GUS reporter system that 
was used in this study [50]. Hence, the role of RRP6L1 in PTGS is 
likely linked to exosome-mediated RNA degradation, suggesting 
that HEN2 and RRP6L1 are involved in at least one similar 
function. 

In animals and fungi, a single MTR4 protein is present in 
nucleoplasm and nucleoli, and essential for both processing/ 
degradation of rRNA precursors and the elimination of all other 
nuclear exosome substrates [8-10,57]. However, both yeast and 
human MTR4 proteins are incorporated in more than one 
exosome activator/ adapter complex. Yeast MTR4 is detected in 
TRAMP4 and TRAMP5 (for TRF4/5 AIRl/2 MTR4 Polyad- 
enylation), each of which comprises a RNA binding protein and a 
non-canonical poly(A) polymerase [58-61]. Although TRAMP4 
and 5 have a similar composition and many redundant functions, 
TRAMP5 seems be more important for the polyadenylation of 
pre-rRNAs while TRAMP4 might be more important for the 
degradation of other non-coding RNAs and intergenic transcripts 
[62-64]. The functional specialization between nucleolar and 
nucleoplasmic exosome activator complexes is clearer in human. 
In nucleoli, hMTR4 is incorporated in a TRAMP-like complex 
which polyadenylates rRNA maturation by-products [9]. In the 
nucleoplasm, hMTR4 is associated with the NEXT (for Nuclear 
EX osome Targeting) complex, which targets PROMPTS ( PRO- 
Moter uPstream TranScripts) for degradation by the exosome [9] . 
Hence, both yeast and animals possess nucleolar and nucleoplas- 
mic exosome activators, which share MTR4 as a central 
component. By contrast, an exosome activating system with two 
specialized RNA helicases has evolved early in the green lineage. 
Interestingly, both the nucleoplasmic fraction of human MTR4 
and the Amhidopsis nucleoplasmic-specific RNA helicase HEN2 
appear associated with similar RNA binding proteins to form 
NEXT and NEXT-like complexes, respectively, and with the cap- 
binding complex [9,21, this study]. These findings suggest a high 
degree of functional conservation between the nucleoplasmic 
fraction of human MTR4 and plant HEN2. By contrast, a 
TRAMP-like complex comprising a non-canonical poly(A) poly- 
merase remains to be identified in plants. Hence, the emerging 
picture is that only the core exosome machinery is conserved in all 
eukaryotes, while exosome-associated activities and activating 
complexes show intriguing diversity and complexity in fungi, 
insects, animals and plants. 

Methods 

Plant material 

With the exception oihen2-l [12] used for the S-PTGS assay, 
all Arahidopsis thaliana plants were of Columbia ecotype (Col-0). 
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T-DNA insertion lines were retrieved from NASC (http:// 
arabidopsis.info/). mtr4-l, mtr4-2, hen2-2 and hen2-4 lines are 
described in [13]. RRP41 RNAi lines are described in [5]. The S- 
PTGS reporter line Hcl was first described in [51]. Hcl/xrn/col 
lines are described in [49]. Unless stated otherwise, plants were 
grown on soil at 20°C with cycles of 16 h light/8 h darkness. 

Sequence analysis and phylogeny 

Sequences were retrieved from Phytozome (http://www. 
phytozome.net), Metazome (http://www.metazome.net) andJGI 
(http://genome.jgi.doe.gov) genome databases, using AtMTR4, 
HEN2 and AtSKI2 proteins as BLAST queries. Structures of 
AtMTR4 and HEN2 were modeled with MODELLER Software 
(http:// modbase.compbio.ucsf edu/ModWeb20-htmI/ modweb.ht 
ml) using the crystal structures of S. cerevisiae MTR4 (PDB 3L90 
and 2XGJ) [34,35] as templates. Alignments were performed with 
Chimera (http://www.cgl.ucsf edu/chimera, for structure-based 
sequence alignments) and ClustaJX (http://www.clustal.org, for 
phylogenetic analysis). The phylogenetic tree was calculated with 
the neighbor-joining algorithm built in ClustalX and 1000 
bootstraps, and drawn with Figtree (http://tree.bio.ed.ac.uk/ 
software/ figtree). 

Expression of GFP and myc-tagged fusion proteins 

For the expression of RRP4, AtMTR4 and HEN2 GFP fusion 

proteins under the control of the 35S CaMV promoter, the coding 
sequences of RRP4, AtMTR4 and HEN2 were amplified from 
cDNA and cloned into vector pK7FVVG2 [65] . For expression of 
GFP-tagged or myc-tagged RRP41 the genomic sequence of 
RRP41 including 1 kb upstream of the RRP41 gene was cloned 
into vectors pGWB604 and pGWB616, respectively [66]. For 
immunoprecipitations (see below), AtMTR4-GFP was expressed 
under the control of its own promoter. To do so, a genomic region 
comprising 1 kb upstream of the MTR4 gene, the first two exons 
and the first intron wcic, fused to the CDS downstream of the 
second exon and cloned into pGWB604 [66]. Constructs that 
aUow the expression of RFP-tagged FIBl [67], XRN2 and 
SRP43a [68] were kind gifts of Jane Brown and Martin Crespi, 
respectively. Infiltration of A', henlhmniana leaves was performed 
as described in [50] except that Pi 9 \vas used as a suppressor of 
silencing. Ai'ahidopds plants were transformed by the floral dip 
method [69]. Root tips of stable transformants were examined 8 
days after germination by confocal microscopy. 

Co-immunoprecipitatlon and mass-spectrometry analysis 
RRP41-myc, RRP41-GFP, AtMTR4-GFP and HEN2-GFP- 
fusion proteins were extracted from flowers of stable Arabidopsis 
transformants and purified using magnetic microparticles coated 
with monoclonal myc or GFP antibodies (MACS purification 
system, Miltenyi Biotech) according to the manufacturer's 
instructions except that SDS was omitted from washing buffers. 
Co-IP experiments were carried out in triplicates for RRP41 and 
MTR4 and duplicates for HEN2 witii 50 mM and 150 mM NaCl. 

For in-gel digestion, samples were separated by SDS-PAGE 
followed by trypsic digestion and peptide extraction as described in 
[70]. Otherwise, proteins were eluted directiy from magnetic 
beads in 1 x Lacmmli buffer, precipitated with 100 mM ammo- 
nium acetate in methanol, and resuspended in 50 mM ammonium 
bicarbonate. After reduction and alkylation steps with 5 mM 
dithiothreitol and 10 mM iodoacetamide, respectively, proteins 
were digested overnight with trypsin 1/25 (w/w). Vacuum dried 
peptides were re-suspended in 15 nl 0.1% FA (solvent A). One 
third of each sample was injected on a NanoLC-2DPlus system 
(nanoFlex ChiP module; Eksigent, ABSciex, Concord, Ontario, 



Canada) coupled to a TripleTOF 5600 mass spectrometer 
(ABSciex) operating in positive mode. Peptides were loaded on 
C18 columns (ChIP C-18 precolumn 300 |J,m ID X 5 mm 
ChromXP and ChIP C-18 analytical column 75 ]Xm ID x 
15 cm ChromXP; Eksigent) and were eluted using a 5%-40% 
gradient of solvent B (0.1% FA in Acetonitrile) for 60 minutes at a 
300 nl/min flow rate. The TripleTOF 5600 was operated in high- 
sensitivity data-dependant acquisition mode with Analyst software 
(vl.6, ABSciex) on a 350-1250 m/z range. Up to 20 of the most 
intense multiply-charged ions (2+ to 5+) were selected for CID 
fragmentation, with a cycle time of 3.3s (TOP 20 discovery mode). 

For protein identification, raw data were converted to Mascot 
Generic File format (mgl) and searched against a TAIR 10 
database supplemented with a decoy database build from reverse 
sequences. Data were analyzed using Mascot algorithm version 2.2 
(Matrix Science, UK) through ProteinScape 3.1 software (Bruker). 
Search parameters allowed N-acetylation (protein N-terminal), 
carbamidomethylation (C) and oxidation (M) as variable peptide 
modifications. Mass tolerances in MS and MS/MS were set to 
20ppm and 0.5Da, respectively. 2 trypsin mis-cleavages were 
aUowed. Peptide identifications obtained from Mascot were 
validated with a FDR <1%. A second algorithm, PEAKS DB 
(version 5.3, BSI Informatics) was used with the same search 
parameters to strengthen the identifications. Identified proteins 
were assessed by the total number of fragmented spectra per 
protein (spectral count). 

Data of three (RRP41, MTR4) or two (HEN2 50 mM NaCl, 
HEN2 150 mM) replicates were crossed. Protein partners were 
considered only if present in all co-IP replicates. AU proteins 
observed in the corresponding control replicates, including same- 
set and sub-set proteins, were discarded. A second filter was set 
using controls from 15 independent other IP experiments in A. 
thaliana carried out by other laboratories in the same MS facility. 
All proteins observed in any of these nc'gati\ e controls were 
discarded from the final lists of partner proteins. 

Go-term analysis of proteins co-purified with MTR4-GFP was 
performed with DAVID (http://david.abcc.ncifcrfgov) [71,72]. 

qRT-PCR analysis 

Plants were grown on MS agar plates supplemented with 0.5% 
sucrose. Plates for induction of RRP41 RNAi contained 8 |J,M 
17P-estradiol [5]. For each target, at least three biological 
replicates from WT, mtr4-l, mtr4-2, hen2-2, hen2-4, RRP41 
non-induced [RRP41 Ctrl) and RRP41 induced [RRP41 RNAi) 
were analyzed. Total RNA was isolated from 7 day-old seedlings 
using TRI-reagent (MRC). cDNA was synthesized from 5 |Xg of 
total RNA with Superscript III reverse transcriptase (Invitrogen) 
using 37.5 pmol of oligo(dT) per 20 |il reaction according to the 
manufacturer's instructions. Samples were ancilyzed as technical 
triplicates in a LightCycler 480 Real-Time PGR System (Roche). 
Each qRT-PCR reaction contained 1 x LightCycler 480 SYBR 
Green I Master Mix (Roche), 5 pmol of each primer, 0.5 |J,1 of 
cDNA in a volume of 10 ixl. ACT2, TIP41 and EXP were used as 
reference mRNAs. 

Microarray analysis 

Wild type, hen2 and 'mlr4 plants were grown on MS agar 
supplemented with Q.y/o sucrose. Total RNA was extracted from 
two biological replicates using Nucleospin RNA plant columns 
(Machery & Nagel). cDNA synthesis and labeling with Cy3-dUTP 
or Cy5-dUTP (Perkin-Elmer-NEN Life Science Products) for 
fluorochrome reversal was performed as described previously [73]. 
Samples were hybridized to NimbleGen whole genome microrrays 
(Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/), ac- 
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cession no. GPL17057 and GPL11005) as described in [73]. 
Topological positions and nucleotide sequence of tiie 1,434,492 
strand-specific NimbleGen probes are available in the FLAGdb"'''^ 
database at http://urgv.evry.inra.fr/FLAGdb++ [45]. Two mi- 
cron scanning was performed with an InnoScan900 scanner and 
raw data were extracted using Mapix software (Innopsys). 

Statistical analysis 

Probes that mapped to repetitive sequences of the Arahidopsis 
genome were excluded from the analysis. Statistical analyses were 
performed with the software R. For each experiment, the raw data 
comprised the logarithm of median feature pixel intensities at 
wavelengths 635 nm (red) and 532 nm (green), respectively. The 
dye bias was corrected by a global intensity-dependent normal- 
ization for each chromosome and each array using the loess 
procedure [74] and then averaged over the technical replicates. 
The outputs of this procedure are two normalized intensity values 
per probe, one for each of the co-hybridized samples. Normalized 
raw data were deposited at Gene Expression Omnibus (http:// 
www.ncbi.nlm.nih.gov/geo/), accession no. GSE48178, and at 
CATdb (http://urgv.evry.inra.fr/CATdb/), accession no. TIL- 
Ath-2011_3. 

For the statistical analysis, each mutant sample was compared 
to the co-hybridized wild type sample. Analyses were performed 
for each biological replicate and independently for each 
chromosome and each strand (2 mutants x 2 biological repli- 
cates x2 strands x5 chromosomes = 40 analysis were performed). 
To determine probes that behave differently between a mutant 
and a wild type sample, we recast the question as an unsupervised 
classification problem for each biological replicate, for each 
chromosome and for each strand. A Hidden Markov Model was 
developed to model the joint distribution of the two normalized 
hybridization intensities in order to distinguish four different 
biologically interpretable clusters of probes: one cluster with 
similar behavior in both samples (identically expressed), one 
cluster with higher intensity in the first sample (over-expressed), a 
symmetric cluster with lower intensity in the first sample (under- 
expressed) and one cluster with low intensities in both samples 
(noise) corresponding to the non-transcribed probes. The 
emission distribution of the noise cluster was modeled by a 
spherical Gaussian. A Gaussian mixture, components of which 
were forced to be colinear along the main axis of the ellipse 
representing each cluster, modeled each of the three other 
clusters. Data projection on the main axes allowed us to work 
with unidimensional mixtures and to put a unique Gaussian 
distribution along the associated perpendicular axis for all 
components. Model parameters were estimated using an adaptc-d 
version of the EM algorithm taking the model constraints and the 
spatial dependency between probes into account. A detailed 
documentation of the statistical analysis can be found in dataset 
S2. 

Probe classification into the four clusters was based on the 
conditional probabilities: a probe was assigned in the cluster for 
which the conditional probability was the highest (MAP rule). A 
probe was declared over-expressed in the mutant if this 
assignment was observed in the two biological replicates. 1860 
probes were identified as overexpressed in hen2 mutants, and 499 
probes were assigned as overexpressed in mtr4 mutants. For the 
majority of the estimated models, the main axis of the ellipse 
representing the cluster of probes under-expressed in the mutant 
was very closed to the main axis of the cluster with identically 
expressed probes. As a consequence, only a small number of 
probes were assigned under-expressed. Intersection of the lists 
with under-expressed probes across the two biological replicates 



revealed that no probes were underexpressed in both replicates of 
hen2 or mtr4, respectively. 

Bioinformatics analysis 

A file allowing the visualization of the upregulated probes 
aligned to the Arahidopsis genome using seqmonk software 
(http://www.bioinformatics.babraham.ac.uk/projects/seqmonk/) 
can be found in dataset SI. Probes with a unique localisation in 
the genome and for which a significant over-expression was 
detected by the statistical analysis were sorted by genome 
coordinates to identify upregulated regions. Only regions with at 
least two consecutive probes were allowed. Bioinformatic analysis 
was performed with adapted perl scripts. Sequences of upregulated 
regions were armotated using TAIRIO genome database, 
FLAGdb"^, and recent studies that identified snoRNA, miRNA 
genes and line RNA genes [45-47,75]. Sequence alignments were 
performed with BLASTn and sim4 tools (for annotation of spliced 
transcripts) [76,77]. Only hits with 100% identity were considered. 
Annotation of upregulated regions was manually curated to 
remove double assignments (e.g. a region that matches to snoRNA 
genes located in an intron of a protein coding genes was assigned 
as snoRNA and removed from the list of introns). For comparison 
with a previously published list of exosome substrates, we 
compared the genome coordinates of upregulated regions with 
coordinates of upregulated regions extracted from [5], taking the 
difference between different versions of TAIR into account. 

Accession numbers (Arabidopsis Genome Initiative, AG!) 

ATMTR4: Atlg59760; HEN2: At2g06990; RRP41: 
At3g61620. 

Supporting Information 

Dataset SI Seqmonk file to visualize the location of significantiy 
upregulated probes in the Arabidopsis genome, and instructions 
for opening the file using the free seqmonk software. 
(ZIP) 

Dataset S2 Detailed documentation of the statistical analysis of 

the tiling data. 

(PDF) 

Figure SI Plant MTR4 proteins have an insertion in the inner 
loop of the arch domain. Top: Sequences of MTR4 and HEN2 
proteins from selected plant species were aligned to the sequence of 
S. cerevisiae MTR4 (highlighted in orange). The alignment is shown 
for 50 aminoacids of the arch domain. Athaliana, Arabidopsis 
thaliana (thalc cress); Thalophila, Thelluniella halophila (Salt cress); 
Sitalica, SeUiria ilalica (Foxtail millet); Mguttatus, Mimulus guUalus 
(Monkey flower); Pvulgaris, Phaesolus vulgaris (Common bean); 
Mtrunculata, Medicago trunculata (Barrel medic); Mesculenta, 
Manihot esculenta (Cassava); Ptrichocarpa, Populus trichocarpa 
(Poplar); Csativus, Cucumis sativus (cucumber); Egrandis, Eucalyp- 
tus grandis (Eucalyptus); SmoeUendorfii, Selaginella moelkndorfii 
(Spikemoss); Ppatens, Physcomilrella palens (Moss); Scerevisiae, 
Saccharomyces cerevisiae (Bakers yeast). Left: Model of the arch 
domains of AtMTR4 and HEN2. AtTvITR4 and HEN2 structures 
were modeled using the yeast MTR4 structure as template. Only 
the arch domain is shown (from the top). Yeast MTR4 in orange, 
HEN2 in green, AtMTR4 in blue. A dashed red line indicates the 
insertion of 9 amino acids present in AtMTR4. 
(PDF) 

Figure S2 Full sequence alignment of MTR4 and HEN2 
proteins from selected species. Amino acids are colored by the 
ClustalX color scheme. Boxes below the alignment illustrate RecA 
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domains (blue), the winged helix domains (yellow), tiie arch 
domain (red, the black box depicts the KOW domain) and the C- 
terminal bundle domain (pink). Characteristic differences between 
plant MTR4 and HEN2 sequences are marked by stars. 
Athahana, Arahidopsix ihaliana (thale cress); Thalophila, Thellu- 
niella halophila (Salt cress); Sitahca, Selaria italica (Foxtail millet); 
Mguttatus, Mimulus guttatus (Monkey flower); Pvulgaris, Phaeso- 
lus vulgaris (Common bean); Mtrunculata, Medicago trunculata 
(Barrel medic); Mesculenta, Manihot esculenta (Cassava); Ptricho- 
carpa, Populus trichocarpa (Poplar); Csativus, Cucumis sativus 
(cucumber); Egrandis, Eucalyptus grandis (Eucalyptus); Smoellen- 
dorfii, Selaginella moellendorfii (Spikemoss); Ppatens, Physcomi- 
trella patens (Moss). 
(PDF) 

Figure S3 MTR4-GFP co-localises with nucleolar marker 
proteins. Transient expression of fluorescent fusion proteins in 
Nicotiana benthamiana leaves. MTR4-GFP is shown in green, 
RFP-fusion proteins are shown in red. FibriUarin-RFP and XRN2- 
RFP are known nucleolar markers; SRP34a-RFP was used as a 
nucleoplasmic marker. The phase contrast picture is shown on the 
right. Scale bars: 15 \lra. 
(PDF) 

Figure S4 HEN2-GFP co-localises with a nucleoplasmic marker 
protein. Transient expression of fluorescent fusion proteins in 
Nicotiana benthamiana leaves. HEN2-GFP is shown in green, 
RFP-fusion proteins are shown in red. FibrUlarin-RFP and XRN2- 
RFP were used as nucleolar markers; SRP34a-RFP was used as a 
nucleoplasmic marker. The phase contrast picture is shown on the 
right. Scale bars: 15 \lm. 
(PDF) 

Figure S5 MTR4 and HEN2 have distinct localization patterns. 
The distribution of the indicated GFP-fusion proteins in root cells 
of stable Arabidopsis transformants is shown on the left. The 
middle column shows DAPI staining. Please note that take-up of 
DAPI by intact, living plant tissue is slow and can lead to a strong 
background signal from cell walls. No, Nucleolus; Np, Nucleo- 
plasm; Cp, Cytoplasm. Scale bars: 5 urn. 
(PDF) 

Figure S6 HEN2-GFP is localized in nucleoplasmic foci. Co- 
expression of IIEN2-GFP and the nucleoplasmic marker protein 
SRP34a in leaves of stable Arabidopsis transformants. Nucleoplas- 
mic foci were observed in all cell types of all stable transformants. 
Scale bars: 15 (im. 
(PDF) 

Figure S7 Sequence alignment of human RBM7 and Arabi- 
dopsis At4gl0110. 
(PDF) 

Figure S8 Supplemental information about selected known 

exosome substrates from Fig. 4. 

(DOCX) 

Figure S9 Accumulation of unsphced transcripts in hen2 
mutants. qRT-PCR. A Diagram of the genomic locus indicated 
by the respective AGI number is shown at the top of each panel. 
Annotated mRNA genes are represented as arrows with dark blue 
boxes for the CDS, light blue boxes for 3' and 5' UTRs, and a 
light blue hne for introns. Red bars above the diagram represent 
probes detected in the microarray analysis. Green arrows above or 
below the diagram depict the location of qRT-PCR primers. The 
corresponding qRT-PCR results for each primer pair are given as 
fold-change relative to WT in the histograms below each diagram. 
mtr4-l in red, mtr4-2 in orange, hen2-2 in light green, hen2-4 in 



dark green, RRP41 control in light grey, RRP41 RNAi in dark 
grey. Error bars = SD in three biological replicates. A. At3g26510 
(with 4 predicted splice variants). qRT-PCR results suggest that 
hen2 and RRP41 RNAi plants accumulate a population of 
transcripts some of which still contain the unspliced acceptor site 
of the first intron (panel V 11 4a/b) and some of which still contain 
the 2nd intron (panel V112a/b). B. Atlg58602. Transcripts 
comprising the unsphced 2nd exon/intron donor site accumulate 
in hen2 and RRP41 RNAi plants. C. At3g43160. Both spliced 
and unsphced transcripts corresponding to the 3' region of the 
At3g43160 locus accumulate in hen2 and RRP41 RNAi plants. 
(PDF) 

Figure SIO Unspliced transcripts from the Atlg7927() locus are 
polyadenylated. Sequences of 3' RACE PGR products obtained 
from hen2-4 samples. cDNA synthesis was initiated using oligo-dT 
as primer. 3' RACE PGR was performed with VI 1 3a (green arrow) 
as forward primer, and the adapter sequence of the oligo-dT primer 
as a reverse primer. PGR products obtained from hen2-4 samples 
were cloned and sequenced. The genomic sequence is given above 
the line, with intronic sequence in purple. Red arrows mark donor 
and acceptor splice sites. Non-encoded nucleotides are in green. 
(PDF) 

Figure Sll Upregulation of mRNAs in hen2 mutants. Accu- 
mulation of mRNAs was tested by qRT-PCR using primer pairs 
located in 5 ' , central or 3 ' regions of the annotated transcripts as 
indicated below each panel. mtr4-l in red, inlr4-2 in orange, 
hen2-2 in light green, hen2-4 in dark green, RRP41 txjntrol in 
light grey, RRP41 RNAi in dark grey. Error bars = SD in three 
biological replicates. 
(PDF) 

Figure S12 mRNAs are not systematically detected in all 
replicates. Accumulation of mRNAs was tested by qRT-PCR using 

primer pairs located in 5 ' or 3 ' regions of the annotated transcripts 
as indicated below each panel. Panels on the left show the qRT- 
PCR results for exactly the same samples that have been used for 
hybridisation to the tiling arrays. Panels on the right show the results 
obtained in 3 independent replicates grown in the same culture 
conditions. A possible explanation for the inconsistence between the 
replicates could be that mRNAs are not bona-fide substrates of 
exosome-mc'diat(-d RNA degradation and might rather be upregu- 
lated due to indirect effects. Other types of transcripts such as short 
mRNA-derived regions, introns, unspliced transcripts or several 
types of non- coding RNAs are consistenfly observed in all 
replicates. mlr4-l in red, mlr4-2 in orange, hen2-2 in light green, 
hen2-4 in dark green, RRP41 control in light grey, RRP41 RNAi 
in dark grey. Error bars = SD in three biological replicates. 
(PDF) 

Figure S13 Loss of HEN2 or the exosome is associated with 
increased levels of polyadenylated snoRNA precursors. Oligo-dT 
primed cDNA was used for 3' RACE- PGR, with primer E5 and 
the adapter sequence of the cDNA synthesis primer as forward 
and reverse primers, respectively. PGR products were separated 
on 2% agarose gels, transferred to Hybond XL membranes, and 
hybridized with radiolabeled probes E6 (mid panel) and E8 (lower 
panel). The diagram below illustrates location of primers and 
probes with respect to the snoRNA genes in this region (see also 
Fig. 9). 
(PDF) 

Figure S14 Polyadenylated transcripts partially antisense to 
AT5G44306. Sequences were amplified by 3' RACE from oligo- 
dT primed cDNA from the indicated samples using a primer 
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(indicated hy the purple arrow) situated antisense to the 5' region 
of AT5G44306. Non-encoded nucleotides are in green. 

(PDF) 

Figure S15 mtr4 mutants accumulate exosome substrates to lower 
levels than hen2 mutants. Boxplot showing averaged intensity values 
for the overexpressed probes identified in each comparison of the 
tUing array analysis. The first two rows show the intensity values in 
WT and hen2 samples for all probes overexpressed in both biological 
replicates of hen2. The third and fourth rows show tlu- intensity- 
values in WT and mtr4 samples for all probes overexpressed in both 
biological replicates ofmtr4. The averaged intensities in ■mtr4 samples 
are significant lower than the averaged intensities in hen2 samples (p- 
value<le -3). The fifth and sixth rows show the intensity values in 
hen2 and mlr4 for the common probes (overexpressed in both 
mutants). Again, the mean average intensity in mtr4 samples is lower 
than the mean average intensity in hen2 samples (p-value<le -3). 
(PDF) 

Figure S16 The microarray analysis probably underestimates the 
contribution of FIEN2 to nuclear RNA surveillance. A diagram of 
the genomic locus indicated by the respective AGI number is shown 
at the top of each panel. Annotated mRNA genes are represented as 
arrows with dark blue boxes for the CDS, light blue boxes for 3' and 
5' UTRs, and a light blue line for introns. Red bars above the 
diagram represent probes detected in the microarray analysis. 
Green arrows above or below the diagram depict the location of 
qRT-PCR primers. The corresponding qRT-PCR results for each 
primer pair are given as fold-change relative to WT in the 
histograms below each diagram. mtr4-l in red, mtr4-2 in orange, 
hen2-2 in light green, hen2-4 in dark green, RRP41 control in light 
grey, RRP41 RNAi in dark grey. Error bars = SD in three 
biological replicates. Upper panel: The upregulation of two 
stretches in the 5' region of Atlg20100 in RRP41 RNAi lines 
(indicated by the red double arrows) was detected in a previous tiling 
microarray study [5] . Our microarray array detected only a portion 
of this region (indicated by the red bars above the diagram). 
However, we could confirm the upregulation of the uppermost 
600 kb of Atlg20100 in bodi hen2 aUeles by qRT-PCR. Middle 
panel: A portion of the fifth intron of At5g27720 was previously 
identified as a target of the exosome core complex [5] (indicated by 
the red double arrows). In our microarray analysis, only one probe 
in this region was declared statistically significant. Since we 
considered only regions with at least two consecutive probes, this 
region was omitted from the data interpretation. Nevertheless, 
qRT-PCR data show that a portion of the fifth intron is upregulated 
in hen2 mutants, while levels of pre-mRNA or mature mRNAs are 
similar to WT. The presence of polyadenylated transcripts 
corresponding either to the entire intron or to shorter degradation 
intermediates in hen2 and RRP41 RNAi samples was further 
confirmed by cloning of 3' RACE products (not shown). Lower 
panel: A previous tiling study [.^] identified a short stretch derived 
from the 5' region of At4g02890 as a target of the exosome core 
complex (indicated by the red double arrows). This region was not 
detected in our tiling analysis, but its upregulation was easily 
detected by qRT-PCR. The discrepancy between the two tiling 
studies is probably largely explained by th(; different arrays designs 
(the NimbleGen 732K array is not designed to detect very small 
regions) and by different statistical analysis procedures. Moreover, 
the accumulation of exosome targets may also vary with growth 
conditions. In fact, several of our validated targets have not been 
picked up in the previous genome-wide analysis [5] , indicating that 
both tiling studies under-estimate the contribution of exosome- 
mediated RNA surveillance. 
(PDF) 



Figure S17 A fraction of AtMTR4-GFP can be detected in the 
nucleoplasm. Intracellular cUstribution of AtMTR4-GFP in root cells 
of a stable Arabidopsis transformant. The nucleoplasmic fraction of 
AtN'ITR4-GFP (white arrows) is more visible in individual 
transformants displaying relative weak transgene expression, which 
is not representative for the majority of the investigated plant lines. 
(PDF) 

Table SI Mass-spectrometric analysis of RRP41 IP experi- 
ments: list of peptides. 

PCLSX) 

Table S2 Mass- spectrometric analysis of MTR4 IP experi- 
ments: list of peptides. 

(XLSX) 

Table S3 Mass- spectrometric analysis of HEN2 IP experiments: 

hst of peptides. 

(XLSX) 

Table S4 Analysis of microarray data. Regions corresponding to 
short stretches of mRNAs, unspliced transcripts, and 3' or 5' 
extended mRNAs. 
(XLSX) 

Table S5 Analysis of microarray data. Regions corresponding to 

introns. 

(XLSX) 

Table S6 Analysis of microarray data. Regions corresponding to 

mRNAs. 

(XLSX) 

Table S7 Analysis of microarray data. Regions corresponding to 

transposable elements. 

(XLSX) 

Table S8 Analysis of microarray data. Regions corresponding to 
snoRNA precursors. 

(XLSX) 

Table S9 Analysis of microarray data. Regions corresponding to 
line and other RNAs. 

(XLSX) 

Table SIO Analysis of microarray data. Potential antisense 

transcripts. 

PCLSX) 

Table Sll Analysis of microarray data. Non-annotated regions. 

(XLSX) 

Table S12 Gene descriptions and biological functions for the 
mRNAs upregulated in hen2 and/or mtr4 samples. 

(XLSX) 

Table S13 Primers used in this study. 
PCLSX) 
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